Nonlinear Laplacian spectral analysis: Capturing intermittent and low-frequency spatiotemporal patterns in high-dimensional data
نویسندگان
چکیده
We present a technique for spatiotemporal data analysis called nonlinear Laplacian spectral analysis (NLSA), which generalizes singular spectrum analysis (SSA) to take into account the nonlinear manifold structure of complex data sets. The key principle underlying NLSA is that the functions used to represent temporal patterns should exhibit a degree of smoothness on the nonlinear data manifold M ; a constraint absent from classical SSA. NLSA enforces such a notion of smoothness by requiring that temporal patterns belong in low-dimensional Hilbert spaces Vl spanned by the leading l Laplace-Beltrami eigenfunctions on M . These eigenfunctions can be evaluated efficiently in high ambient-space dimensions using sparse graph-theoretic algorithms. Moreover, they provide orthonormal bases to expand a family of linear maps, whose singular value decomposition leads to sets of spatiotemporal patterns at progressively finer resolution on the data manifold. The Riemannian measure of M and an adaptive graph kernel width enhances the capability of NLSA to detect important nonlinear ∗[email protected] 1 processes, including intermittency and rare events. The minimum dimension of Vl required to capture these features while avoiding overfitting is estimated here using spectral entropy criteria. As an application, we study the upper-ocean temperature in the North Pacific sector of a 700-year control run of the CCSM3 climate model. Besides the familiar annual and decadal modes, NLSA recovers a family of intermittent processes associated with the Kuroshio current and the subtropical and subpolar gyres. These processes carry little variance (and are therefore not captured by SSA), yet their dynamical role is expected to be significant.
منابع مشابه
Comparing low-frequency and intermittent variability in comprehensive climate models through nonlinear Laplacian spectral analysis
Nonlinear Laplacian spectral analysis (NLSA) is a recently developed technique for spatiotemporal analysis of high-dimensional data, which represents temporal patterns via natural orthonormal basis functions on the nonlinear data manifold. Through such basis functions, determined efficiently via graph-theoretic algorithms, NLSA captures intermittency, rare events, and other nonlinear dynamical ...
متن کاملNonlinear Laplacian spectral analysis for time series: Capturing intermittency and low-frequency variability
Many processes in science and engineering develop multiscale temporal and spatial patterns, with complex underlying dynamics and time-dependent external forcings. Because of the importance in understanding and predicting these phenomena, extracting the salient modes of variability empirically from incomplete observations is a problem of wide contemporary interest. Here, we present a technique f...
متن کاملNonlinear Laplacian spectral analysis for time series with intermittency and low-frequency variability.
Many processes in science and engineering develop multiscale temporal and spatial patterns, with complex underlying dynamics and time-dependent external forcings. Because of the importance in understanding and predicting these phenomena, extracting the salient modes of variability empirically from incomplete observations is a problem of wide contemporary interest. Here, we present a technique f...
متن کاملLimits of predictability in the North Pacific sector of a comprehensive climate model
We study limits of interannual to decadal predictability of sea surface temperature (SST) in the North Pacific sector of the Community Climate System Model version 3 (CCSM3). Using a set of low-frequency and intermittent spatiotemporal SST modes acquired through nonlinear Laplacian spectral analysis (a nonlinear data manifold generalization of singular spectrum analysis), we build a hierarchy o...
متن کاملNonlinear Laplacian spectral analysis of Rayleigh-Bénard convection
The analysis of physical datasets using modern methods developed in machine learning presents unique challenges and opportunities. These datasets typically feature many degrees of freedom, which tends to increase the computational cost of statistical methods and complicate interpretation. In addition, physical systems frequently exhibit a high degree of symmetry that should be exploited by any ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Statistical Analysis and Data Mining
دوره 6 شماره
صفحات -
تاریخ انتشار 2013